Model Selection

High-precision Recognition

# High-precision Recognition

Roberta Base Ai Text Detection V1

A fine-tuned model based on RoBERTa-base for detecting AI-generated English text.

Text Classification

Transformers English

Bert Large Uncased Merged

This is a dataset for phishing attack detection, primarily used to train BERT models to identify phishing websites.

Text Classification

Transformers English

Nicpras Finetuned Yolo

This is a fine-tuned object detection model based on the YOLOv3 architecture, optimized for specific scenario recognition tasks

Object Detection

Plant Identification Vit

A plant identification model fine-tuned based on Google Vision Transformer (ViT) architecture, achieving 80.96% accuracy on the evaluation set

Image Classification

Videomae Large Finetuned Deepfake Subset

A fine-tuned version based on MCG-NJU/videomae-large model on the deepfake detection challenge dataset, used for video deepfake detection.

Video Processing

YOLOv10 is a real-time object detection model that achieves efficient and overhead-free object detection by eliminating post-processing steps such as Non-Maximum Suppression (NMS).

Object Detection

Detr Face Detection

A face detection model based on the CreativeML-OpenRAIL-M license, supporting the English language, primarily used for object detection tasks.

Object Detection

Transformers English

Trocr Base Plate Number

An example vision model for recognizing vehicle license plates, capable of extracting license plate numbers from images.

Text Recognition

Xlm Roberta Base Language Detection ONNX

A multilingual detection model based on XLM-RoBERTa, capable of identifying the language category of text.

Text Classification

Donut Cn Invoice

An AI model specialized in Chinese invoice recognition, capable of accurately extracting key information from invoices.

Large Language Model

Transformers Chinese

Convnextv2 Large DogBreed

This model is a fine-tuned version of facebook/convnextv2-large-22k-224 on a dog breed classification dataset, achieving an accuracy of 91.39% on the evaluation set.

Image Classification

Fashion Images Gender Age Vit Large Patch16 224 In21k V3

This model is a vision Transformer model fine-tuned on a fashion image gender and age classification dataset based on Google's ViT-Large architecture, achieving 99.6% accuracy on the evaluation set.

Image Classification

Plant Vit Model 1

A plant image classification model based on the ViT architecture, achieving 99.95% validation accuracy after fine-tuning on an unknown dataset

Image Classification

Detr Resnet 101

End-to-end object detection model based on Transformer architecture with ResNet-101 feature extractor

Object Detection

A visual model for plant leaf condition classification, capable of accurately identifying and analyzing the health status of plant leaves.

Image Classification

My Awesome Food Model

Food classification model fine-tuned on the food101 dataset based on Google's ViT model

Image Classification

Food image classification model based on Google Vision Transformer (ViT) architecture, fine-tuned on the Food101 dataset with an accuracy of 90.9%

Image Classification

My Awesome Food Model

Food image classification model based on ViT architecture, fine-tuned on the Food101 dataset with an accuracy of 89.7%

Image Classification

Vit Base Highways 2

A fine-tuned Vision Transformer model based on google/vit-base-patch16-224-in21k, achieving 70% accuracy on an unknown dataset

Image Classification

This is an image classification model based on the Vision Transformer (ViT) architecture, specifically designed for legume recognition tasks.

Image Classification

Image classification model fine-tuned on the herbier_mesuem5 dataset based on google/vit-base-patch16-224-in21k

Image Classification

Swin Finetuned Food101

An image classification model fine-tuned on the Food101 dataset based on the Swin Transformer architecture, achieving an accuracy of 92.14%

Image Classification

Lmv2 G Aadhaar 236doc 06 14

This model is a fine-tuned version based on microsoft/layoutlmv2-base-uncased, specializing in document information extraction tasks, excelling in extracting fields such as Aadhaar card numbers, date of birth, gender, and names.

Sequence Labeling

Swin Finetuned Food101

A food image classification model fine-tuned based on the Swin Transformer architecture, achieving 92.1% accuracy on the Food101 dataset

Image Classification

Resnet 50 Base Beans Demo

An image classification model fine-tuned on the beans dataset based on the ResNet-50 architecture, achieving an accuracy of 90.23%

Image Classification

Snacks Classifier

A lightweight image classification model based on Microsoft's Swin Transformer Tiny architecture, achieving 92.86% test accuracy after fine-tuning on a snack classification dataset

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase